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ABSTRACT 


During  1943  and  the  early  part  of  1944,  men  processed  at  the  New 
London  Submarine  Base  were  given  a  battery  of  tests  which  included  a  Two- 
Hand  Coordination  Test,  the  Otis  Self -Administering  Tests  of  Mental  Ability f 
the  Personal  Inventory,  the  New  London-NDRC  Confidential  Questionnaire,  and 
the  NRG  Neurotic  Inventory*  These. test  sooree,  along  with  school  grades  ard 
a  set  of  trait-ratings  made  by  submarine  officers,  were  later  made  avai?.ab:9 
to  Project  N-106  for  study  and  analysis*  This  analysis  involved  the  coor ela¬ 
tion  of  test  scores  and  school  grades  with  a  selection  of  trait-ratings  in 
order  to  determine  the  prediotive  value  of  these  measures* 

As  a  result  of  these  analyses  the  following  conclusions  were  reached; 

1*  Soores  on  the  Personal  Inventory  and  the  other  selection 

measures  were  found  to  be  unrelated  to  quality  of  performance 
aboard  submarines  as  it  was  rated  by  officers*  In  a  similar 
manner,  an  analysis  of  psychiatrists*  evaluations  of  men 
judged  to  be  "emotionally  unstable"  suggests  that,  for  this 
sample  at  least,  these  evaluations  are  no  more  closely 
related  to  officers’  ratings  than  those  made  by  the  paper  and 
pencil  tests* 

Evidence  gleaned  from  earlier  investigations  would  suggest 
that  the  present  findings  are  by  no  means  typioal  —  the  PI, 
as  well  as  the  other  measures  of  the  selection  battery,  has 
been  shown  to  be  oapable  of  making  reliable  discriminations 
among  oertain  types  of  groups*  The  lack  of  correlation 
reported  in  the  present  study  can  be  attributed  to  the  faot 
that  (a)  the  sample  dealt  with  is  seleot  with  respeot  to 
mental,  physical,  and  emotional  characteristics;  (b)  the 
criterion  is  undoubtedly  quite  unreliable;  and  (o)  the 
oriterion  is  not  entirely  appropriate. 

2*  An  analysis  of  psychiatrists'  evaluations  of  men  judged  to 
be  "emotionally  unstable"  suggests  that,  for  this  sample 
at  least,  these  evaluations  are  no  more  valid  than  those 
made  by  the  paper  and  pencil  tests* 

3*  Grades  made  in  the  submarine  training  courses  oan,  to 
a  certain  extent  be  used  in  predicting  ratings  made  by 
submarine  offioers.  Likewise,  oertain  of  the  advanced 
school  grades  —those  received  after  Diesel,  Battery  and 
Gyro,  Ordnanoe,  Sound,  and  Radio  Training  —  also  offer 
possibilities  as  predictors* 

4*  Since  grades  offer  the  most  by  way  of  possibilities  for 
improved  prediction,  it  is  suggested  that  further  work 
be  directed  toward  the  improvement  of  both  the  sohool 
grade  and  the  oriterion* 


AN  EVALUATION  OF  THE  PERSONAL  INVENTORY  AND  CERTAIN 
OTHER  MEASURES  IN  THE  PREDICTION  OF 
SUBMARINE  OFFICERS*  EVALUATIONS  OF 
ENLISTED  MEN 


I.  INTRODUCTION 

The  Personal  Inventory  and  other  psyohologioal  tests  employed  at  the 
U.  S,  Submarine  Base  at  New  London,  Connecticut,  have  been  evaluated  by 
members  of  the  staff  of  NDRC  Project  N-113  working  cooperatively  with  the 
offioers  of  the  liedioal  Research  Laboratory  at  the  Base,!  In  making  these 
studies  various  criteria  we  re  used:  psyohiatric  evaluations,  offioers* 
ratings  of  experienced  submarinemen,  and  ratings  of  escape  tank  perf ormance,. 
The  present  analysis  extends  these  evaluations  to  a  number  of  other  tests 
and  to  the  grades  assigned  submarinemen  at  the  termination  of  their  basic 
and  advanced  training;  in  this  instance,  ratings  made  by  submarine  officers 
are  vised  as  criteria* 


II.  THE  DATA 

The  data  whioh  form  the  basis  of  this  Report  were  taken  from  the  card 
files  of  the  Medioal  Research  Laboratory  at  the  New  London  Submarine  Base 
■where  the  men  had  been  processed*  bhen  the  work  of  Project  N-113  was  ter¬ 
minated,  these  data  were,  at  the  request  of  the  Applied  Psychology  Panel, 
turned  over  to  Project  N-106  for  analysis  and  study. 

These  record  cardB  provide  some  twenty-eigjvt  items  of  information 
(test  Boores,  grades,  eto.)  which  include: 

1*  Ago  —  recorded  as  of  last  birthday  at  the  time  of  testing, 

2*  Grade  in  Basic  Training  —  recorded  in  terms  of  the  0-4  Navy  scale* 

3*  Class  standing  in  Basic  Training  —  recorded  in  terms  of  percentage 
position  as  oomputed  by  means  of  the  formula  (rank  -  ,5)  100  • 

- N - 

4-20*  Grade  in  Advanced  Training  —  reoorded  in  terms  of  the  0-4  Navy 
scale  for  the  following  advanoed  schools: 

4*  Diesel  10*  Sound  16*  Quartermaster 

6,  Battery  and  Gyro  12,  Radio  18,  Pharmacist’s  Mate 

8,  Ordnanoe  14.  Radar  20.  Lookout 

1.  NbRC  Project  44,  biv,7,  A  keport  of  Research  on  Selection  Tests  at  the 

U.S. Submarine  Base,  New  London.  OSRD  Report  No.  1770,  Providence,  k.I,, 
Brown  University,  1943.  Pp.&5,  NDRC  Projeot  N-113,  Final  Report  in  Stanmary 
of  Work  on  the  Personal  Inventory  and  Other  Tests,  DBHd  Report  No. 3963, 
Providence,  k.I.,  Brown  University,  1944.  Pp.13 


5-21,  Class  standing  in  Advanced  Training  —  recorded  for  each  of 
the  nine  schools  in  terms  of  percentage  position  as  computed 
by  means  of  the  formula  (rank  -  .5)100  . 

- n - 

22.  Ratings  on  Escape  Tank  Performance  —  recorded  in  terms  of  a 
1  to  5  rating. 

23.  Score  on  the  Two-Hand  Coordination  Test.  Soores  represent  the 
total  amount  of  time  "spent  in  contaot"  on  a  motor  pursuit 
task.2 3 4 5 6 7 

24.  Score  (IQ)  on  the  Otis  Self -Administering  Tests  of  Mental 
Ability.  Higher  Examination;  Porm  B.® 

25.  Score  on  the  Personal  Inventory,  Format  B.  This  is  a  group 
test  of  145  items  which  Project  N-113  developed  for  use  in 

the  preliminary  screening  of  "psychiatrically  undesirable  men'  ' 

26.  Score  on  the  New  London-NDRC  Confidential  Questionnaire.  Th? a 
questionnaire,  whioh  samples  various  aspects  of  the  individua 1 J n 
background,  was  developed  at  New  London  end  validated  on  a  sub¬ 
marine  sohool  group  by  using  psychiatric  evaluations  and  tank 
performance  as  criteria.® 

27.  Soore  on  the  NRC  Neurotic  Inventory,  Farm  R-2.  This  inventory 
consists  of  forty-seven  items  sampling  "likes  and  dislikes" 
and  "wishes  and  worries". 

28.  Ratings  of  proficiency  made  by  submarine  officers.  By 
directive,  ratings  were  made  on  a  seven-trait  graphic  scale 
and  were  returned  to  the  Medical  Research  Laboratory  where 
they  were  entered  on  the  record  card.  (See  Illustration  1.)^ 
These  ratings  were  collected  with  the  end  in  view  of  later 
using  them  in  the  evaluation  of  the  selection  measures. 

These  items  of  information,  in  more  or  less  complete  form,  were  made 
available  for  some  1600  men  representing  40  submarines  of  the  Atlantic  Fleet 
and  58  of  the  Pacific.  These  men  (a)  had  been  processed  at  the  Base  during 
1943  and  the  early  part  of  1944,  (b)  were  sent  to  sea  as  members  of  a  submarine 
orew,  and  (c)  were  rated  on  the  seven-trait  graphic  scale  after  having  com¬ 
pleted  at  least  one  war  patrol.  Thus,  for  this  group  of  1600  men  the  New 


2.  NDRC  Project  44,  Div.7,  A  Report  on  Research  on  Selection  Tests  at  the 
U.S. Submarine  Base,  New  London.  QSRD  Report  too .  1770.  Providence,  R.I., 
Brown  University,  1943.  P.  5. 

3.  Otis,  A.S.,  Otis  Self-Administering  Tests  of  Mental  Ability.  Higher 
Examination,  Form  B.  Yonkers-on-Hudson,  N.Y.,  World  Book  Co.,  19^2. 

4.  HDRC  Project  N-113,  op.  cit.,  p.  i. 

5.  NDRC  Project  44,  Div.  7,  op.  cit.,  p.  5 

6.  Ibid,  p.5 

7.  See  Section  III,  "The  Criterion,"  for  a  description  and  evaluation  of 
the  scale. 


London  records  provided  enough  data  to  permit  the  evaluation  of  test  scores 
and  grades  as  prediotors  of  the  ratings  later  made  by  submarine  officers* 

III.  PROCEDURE 

The  prooedure  which  was  followed  in  evaluating  the  predictive  value  of 
the  measures  described  above  is,  in  general*  as  follows: 

(1)  The  ratings  made  by  the  submarine  officers  of  the  98  vessels  were 
studied  for  the  purpose  of  selecting  those  vessels  whose  ratings 
were  best  suited  for  use  as  criteria. 

(a)  All  the  ratings  from  each  of  the  vessels  were  tabulated. 

(b)  On  the  basis  of  these  tabulations,  those  vessels  whose 
ratings  lacked  "spread"  (where  a  large  proportion  of  the 
ratings  fell  in  one  category  of  the  scale)  were  discarded 
from  the  sample. 

(c)  Inter cor relations  among  the  trait  ratings  from  those 
vessels  aocepted  under  "b"  above  were  computed  and  factor 
analyzed. 

(d)  The  results  of  the  factor  analysis  were  used  as  the  basis 
for  selecting  certain  of  the  trait  fcatings  for  use  as 
criteria  in  studying  the  validity  of  the  selection  and 
achievement  measures. 

(2)  Test  scores  and  school  grades  were  correlated  with  the  ratings 
on  the  selected  traits.  In  this  analysis  the  Atlantio  and 
Pacifio  fleets  were  treated  independently. 

The  findings  whioh  resulted  from  oarrying  out  these  steps  are  dis¬ 
cussed  in  the  next  two  sections.  The  study  of  the  criterion  measure  and 
the  selection  of  ratings  is  described  in  Section  IV  below,  and  statistical 
evaluations  of  the  selection  tests  and  school  grades  are  presented  in 
Section  V. 


IV.  THE  CRITERION 

In  evaluating  the  measures  used  as  criteria  in  this  study,  it  is 
well  to  bear  in  mind  the  form  which  these  evaluations  took  and  the  pro¬ 
cedures  followed  in  collecting  them.  The  ratings,  by  directive,  were 
made  and  reported  on  a  standard  form  (see  Illustration  1).  This  fonn 
required  the  submarine  offioer  to  evaluate  his  men  on  seven  traits  pre¬ 
sumed  to  be  related  to  performance  on  board  the  submarine.  These  traits 
were:  (l)  Adaptability,  (2)  Alertness.  (3)  Courage,  (4)  Sociability,  (5) 
Leadership,  (6)  Dependability,  and  (7)  Adequacy  of  Overall  Performance. 
The  officer  made  his  evaluations  on  a  three-point  graphic  scale,  the 


Return  to:  Medical  Research  Laboratory 
Submarine  Base 
New  London,  Connecticut 


Name  Service  No.  Rate 

_ ,  1944. 

Ship  Date 


This  report  covers  ^ _ weeks 

of  patrol,  and  was*completed 
by  the  f Commanding  Officer 

[Executive  Officer 
(Check  which).  Make  report 
out  in  ink. 


1.  Adaptability  To  DU- - - 

cipline:  (Willingness  to  Insubordinate;  Obeys  without 
obey  orders;  coopera-  disciplinary  comment;  does 

tiveness.)  problem.  no  more  nor 

better  than  he 
must. 


2.  Alertness:  (Indus- _ _ _ 

try,  Initiative,  Self-Im-  Highly  inter-  Normal  indus- 
provement.)  ested;  unusual  try  and  inter- 

effort  to  im-  est. 
prove  self. 

3.  Courage:  (Emotion- _ _ 

al  Stability;  Reaction  Unruffled  by  E  x  c  i  t  e  d  by 

to  Stress.)  any  danger;  danger;  but 

able  to  per-  performs  satis- 

form  well  in  factorily. 

all  situations. 


4.  Sociability:  (Ability - - 

to  get  along  with  other  Extreme  ly  Liked  quite 
men.)  well  liked;  well  by  most. 

Good  mixer; 
makes  friends 
easily. 


5.  Leadership:  (De- - ; - — - - - 

gree  to  which  respected  Ineffective  in  Occasionally  a 
and  followed.)  g  r  o  u  p  s  ;  al-  leader;  fairly 

ways  follows  well  consider- 
lead  of  others,  ed.  - 


Follows  orders 
with  despatch 
and  interest. 


Not  very  alert; 
tends  to  be 
satisfied  with 
his  lot. 


Tends  to  go  to 
pieces  under 
stress. 


Few  friends; 
generally 
avoided. 


Lead  followed 
by  all;  natural 
leader. 


6.  Dependability:  (Re- - - - 

sponsibility  and  Reli-  Can  be  relied  Reliable  under  Unreliable; 
ability.)  upon  to  per-  most  circum-  cannot  be  de¬ 

form  well  un-  stances;  rare-  pended  upon, 
der  .  any  cir-  ly  disappoints, 
c u  instances 
and  at  any 
time. 


7.  Adequacy  of  Over-  — - — ; -  TTTr^ — 7 - 

all  Performance:  Inferior  man;  Satisfactory; 

would  like  to  desire  to  re¬ 
replace.  tain. 


Extreme  ly 
valuable;  spe¬ 
cial  desire  to 
retain. 


ADDITIONAL  COMMENT 


Your  Initials  Here 


Illustration  It  Rating  Soale  Form  Used  in  Collecting 
the  data  for  the  Criterion  Measure. 


points  of  which  were  defined  by  adjectives  and  short  descriptive  phrapes. 

On  completing  the  ratings,  the  offioer  initialed  the  rating  form  and 
returned  it  to  the  Medical  Research  Laboratory  at  New  London.  Sixteen 
hundred  of  these  rating  farms  were  turned  over  to  Project  N-106  for  study 
and  analysis. 

Under  the  procedures  described  above,  it  should  be  noted  that  it  was 
not  possible  to  train  the  officers  in  rating  procedures  or  to  review  and 
follow-up  their  evaluations  periodically.  This  would  lead  one  oo  anticipate 
(in  terms  of  the  experiences  of  previous  investigators)  that  most  of  the 
errors  which  are  commonly  made  by  untrained  raters  would  be  present  in 
varying  amounts  in  our  sample  of  ratings  and  that  these  errors  would  reduoe 
the  usefulness  of  the  ratings  as  criteria.  Steps  were  thus  taken  to  dis~ 
oard  those  ratings  which  were  obviously  worthless  as  measures  of  perform¬ 
ance.  The  procedure  adopted  was  as  follows:  The  ratings  received  were 
tabulated  for  each  submarine  and  the  number  of  ratings  falling  in  each 
oategory  of  the  scale  was  determined.  These  tabulations  revealed  that 
there  was  very  little  spread  (most  of  the  ratings  fell  in  the  middle 
oategory  of  the  scale)  in  the  ratings  made  by  some  submarine  officers. 

Quite  arbitrarily,  but  in  the  interest  of  improving  the  criterion,  ratings 
from  9  Atlantio  and  6  Pacific  vessels  were  discarded.  The  standard  was 
based  on  the  percentage  of  ratings  in  the  extreme  categories  (see  Figure  l'a 
If  less  than  26  percent  of  the  ratings  of  the  men  on  a  vessel  fell  in  the 
extreme  categories,  the  ratings  for  all  the  men  from  such  a  vessel  were  dis  - 
oarded.  In  all,  the  ratings  for  150  men  from  the  Atlantic  fleet  and  67  f r :•■?>. 
the  Pacifio  were  rejected  for  this  reason.  Further,  some  cf  the  vessels 
supplied  so  few  ratings  that  it  was  felt  advisable  to  discard  these  vessels 
from  the  sample  on  the  ground  that  valid  ratings  are  difficult  to  make  ■when 
the  frame  of  reference  in  which  the  ratings  are  made  is  defined  by  a  small 
number  of  men  (see  Figure  l).  Ratings  from  two  of  the  vessels  (10  men)  from 
the  Atlanbio  fleet  and  11  (42  men)  from  the  Pacific  were  discarded  on  this 
basis.  Thus,  by  applying  these  two  criteria  in  selection  the  total  number 
of  ratings  available  for  use  as  criteria  was  reduced  from  1600  to  1400. 

The  effect  of  this  selection  an  the  statistical  characteristics  of 
the  ratings  is  illustrated  in  Tables  1  and  2.  In  general,  the  effect  of 
selection  was  to  increase  the  spread  of  the  ratings  (compare  traits  of 
Tables  1  and  2)  and,  as  one  might  anticipate,  the  intercorrelations  between 
the  individual  trait  ratings. 

The  magnitude  of  these  intercorrelations  raised  the  question  as  to 
whether  or  not  each  of  the  rating  scale  traits  should  be  used  as  an  indepen¬ 
dent  criterion  in  the  validity  analysis.  Obviously,  if  the  correlation 
between  traits  iB  high,  there  is  little  to  be  gained  by  treating  each  of 
these  traits  as  independent.  To  rationalize  a  selection  of  traits  to  be 
used,  a  faotor  analysis  of  the  rating  soale  traits  was  made.  Although  two 
factors  were  extracted  from  the  matrix  (see  Table  3),  one  of  -the  faotcrs 
contributes  very  little  to  the  variance  of  any  of  the  traits  and  should, 
perhaps,  not  have  been  extracted.  Presumably,  a  great  deal  of  "halo" 
characterizes  the  individual  trait  ratings  and  one  could  choose  almost  any 


TABLE  1 


Intercorrelations*  of  Rating  Scale  Traits 
(Accepted  Ratings) 


Trait 

2 

3 

on 

6 

m 

M 

0 

(1)  Adaptability 

.523 

.293 

.388  .393 

.495 

•480 

•  720 

8*63 

.512 

(2)  Alertness 

.329 

.427  .487 

•  593 

.624 

.788 

2.34 

.622 

(3)  Courage 

.364  .374 

.489 

.416 

•599 

2.38 

.525 

(4)  Sociability 

•412 

.495 

•  512 

.670 

2.32 

Owv.. 

(5)  Leadership 

.500 

.457 

.714 

1.94 

#Eja 

(6)  Dependability 

.671 

.819 

2.33 

.545 

(7)  Overall  rating 

.811 

2.22 

.545 

Sum  of  trait  ratings 

16.15 

2,782 

TAB  IE  2 

Intercorrelations*  of  Rating  Scale  Traits 
(Rejeoted  Ratings) 

Trait 

2 

3 

4  5 

6 

7 

Sum 

M 

0 

(1)  Adaptability 

.474 

•  168 

.211  .289 

.371 

.337 

•693 

2.32 

.476 

(2)  Alertness 

•231 

.289  .617 

.470 

.627 

.782 

2.12 

.602 

(3)  Courage 

.273  .236 

•194 

.236 

•463 

2.08 

.308 

(4)  Sociability 

.239 

.289 

•145 

.518 

2.11 

.324 

(5)  Leadership 

.441 

•  546 

.593 

1.94 

.306 

(6)  Dependability 

.469 

•642 

2.09 

•330 

(7)  Overall  rating 

.678 

2.03 

•302 

Sum  of  trait  ratings 

14.69 

1.639 

*  These  values  are  coefficients  of  contingency* 
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one  of  the  traits  to  represent  whatever  abilities,  performances,  etc.  are 
described  in  these  ratings.  Table  3  also  shows  that  only  about  half  of  the 
variance  in  the  trait  ratings  (see  h^  column)  is  accounted  for  by  the 
operation  of  the  two  factors  which  were  extracted;  it  would  seem  that  either 
the  ratings  are  very  unreliable  or  that  one  must  call  upon  specific  factors 
to  account  for  the  total  varianoe  in  the  ratings.  In  terms  of  what  is 
known  about  ratings  in  general  the  latter  explanation  seems  the  less  tenable. 

TABLE  3 

Factor  Loadings  and  Communalities 
of  the  Rating  Scale  Traits 


Trait 

kl 

*2 

h2 

(l)  Adaptability 

•  639 

-.221 

.457 

(2)  Alertness 

.745 

-.238 

.612 

(3)  Courage 

•569 

.228 

.376 

(4)  Sociability 

.642 

.097 

.422 

(5)  Leadership 

.645 

-.100 

.426 

(6)  Dependability 

.808 

.112 

.665 

(7)  Overall  Rating 

.791 

.090 

.633 

Figure  2  shows  that  it  is  impossible  to  define  two  distinct  clusters 
of  traits  to  represent  the  two  factors  extracted  —  one  would  hope  for  two 
clusters  of  points  falling  at  right  angles  to  each  other  —  and  that 
actually  there  is  very  little  justification  for  choosing  one  of  the  traits 
in  preference  to  another,  as  a  validation  criterion.  As  a  precaution,  three 
traits  (1,  3,  and  7)  as  well  as  the  sum  of  all  seven  trait  ratings,  were 
chosen  to  represent  the  abilities  and  performances  measured  by  the  soale. 
This  procedure  made  it  possible  to  eliminate  approximately  half  of  the 
work  involved  in  correlating  each  of  the  traits  with  the  selection  tests 
and  with  the  school  grades.  An  exception  was  made  in  the  case  of  the 
Personal  Inventory;  scores  on  this  test  were  correlated  with  each  of  the 
seven  trait  ratings  and  also  with  the  sum  of  the  seven  traits. 

Thus,  the  analyses  described  in  this  Section  resulted  in  a  selection 
of  ratings  with  an  improved  spread,  and  a  set  of  three  traits  which  can  be 
taken  to  represent  the  entire  scale  of  seven. 


V.  RESULTS 


A.  The  Validity  of  the  Personal  Inventory 

Scores  on  the  Personal  Inventory  were  correlated  with  the  trait 
ratings  transformed  into  numerical  values.  The  resulting  coefficients  are 
reported  in  Tables  4  and  5.®  It  is  difficult  to  arrive  at  any  conclusion 
other  than  that  scores  on  the  Personal  Inventory  are  unrelated  to  per¬ 
formance  as  it  is  described  by  officers'  ratings.  It  is  conceivable  that, 
since  the  men  were  in  part  selected  on  the  basis  of  PI  scores,  the  low 
validity  coefficients  are  the  result  of  a  curtailed  range  of  adjustment; 
this  is  in  part  borne  out  by  the  data  at  hand.  For  the  Atlantic  Sample 
the  PI  distribution  had  a  mean  of  7.78  and  a  sigma  of  4.04;  for  the 
Pacific,  mean  and  sigma  were  6.94  and  3.85,  respectively.  Both  of  these 
statistics  are  somewhat  lower  than  those  which  NDRC  Project  N-113  has 
reported®  for  other  Navy  groups  but  hardly  enough  so  as  to  account  for 
the  zero  validities  obtained. 

It  might  also  be  argued,  and  perhaps  more  reasonably,  that  the 
criterion  is  inappropriate  in  the  sense  that  the  PI  was  not  designed  to 
make  the  type  of  discrimination  which  the  present  criteria  demand  -"mental 
health",  "personal  adjustment,"  "neurotic  tendency",  or  whatever  facet  of 
personality  structure  which  is  tapped  by  the  Personal  Inventory  — is  only 
indirectly,  and  not  very  accurately,  reflected  in  the  trait  evaluations 
made  on  the  rating  scale.  Of  course,  this  type  of  argument  can  be  used 
to  rationalize  any  table  of  low  validity  coefficients  but  in  this  parti¬ 
cular  case  it  would  seem  more  than  a  rationalization.  The  PI  was 
validated  originally  on  a  psychiatric  criterion;  and  in  successive  trials, 
when  similar  criteria  were  employed,  the  instrument  stood  up  reasonably 
we  11. lilhen  many  investigators  frequently  have  had  difficulty  in  making 
personality  measures  stand  up  from  one  sample  to  another,  it  would  seem 
unjust  to  expect  the  Personal  Inventory  lor  for  that  matter,  any  similar 
instrument)  to  stand  up  from  one  criterion  to  anolher. 


8.  It  will  be  noted  that  these  values  are  based  on  a  sample  which  is  con¬ 
siderably  smaller  than  the  entire  group  dealt  with;  this  results  from 
the  fact  that  the  Personal  Inventory  was  not  administered  during  the 
entire  period  covered  by  our  sample  of  records. 

9.  NDRC  Project  N-113,  op.  cit.,  P.iii. 

10.  The  record  cards  supplied  a  psychiatrist's  evaluations  far  a  limited 
npniber  of  men  who  were  referred  to  a  psychiatrist  for  special  interview. 

In  general,  the  number  of  men  falling  in  the  four  rubrics  which  the 
psychiatrist  used  in  classifying  these  men  is  so  small  as  not  to  warrant 
extended  statistical  analysis.  Evaluations  of  some  200,  however,  who 
were  classified  as  suffering  from  various  degrees  of  "emotional  insta¬ 
bility",  were  correlated  with  the  submarine  officers'  ratings  on  three  of 
the  rating-scale  traits;  none  of  the  X^'s  attain  statistical  significance. 
There  is  little  in  the  data  which  would  suggest  that  (for  the  present 
sample)  psychiatrists'  evaluations  are  any  better  predictors  than  the  PI. 


TABLE  4 


Validity  of  the  Personal  Inventory  in 
Predicting  Submarine  Officers'  Ratings  of 
347  Enlisted  Men 
(ATLANTIC  FLEET) 


Soale  Traits 

r* 

M 

O 

(l)  Adaptability 

-.033 

2.56 

.53 

(2)  Alertness 

-.076 

2.23 

.63 

(3)  Courage 

.009 

2.26 

•48 

(4)  Sociability 

-.071 

2.27 

.52 

(5)  Leadership 

-.045 

1.88 

•46 

(6)  Dependability 

-.019 

2.18 

.52 

(7)  Overall  Rating 

.005 

2.11 

.46 

Sum  of  Raw  Ratings 

-.048 

15.49  2 

.64 

TABLE  5 

Validity  of  the  Personal  Inventory  in 
Predicting  Submarine  Officers'  Ratings  of 
683  Enlisted  Men 
(PACIFIC  FLEET) 

Soale  Traits 

r* 

M 

e 

(l)  Adaptability 

-.061 

2.67 

.49 

(2)  Alertness 

-.063 

2.42 

.60 

(3)  Courage 

-.013 

2.42 

•  53 

(4)  Sociability 

-.037 

2.35 

•54 

(5)  Leadership 

-.079 

1.95 

•  53 

(6)  Dependability 

-.037 

2.39 

•  54 

(7)  Overall  Rating 

-.068 

2*29 

•57 

Sum  of  Raw  Ratings 

-.071 

16.48  2 

.76 

*  The  signs  of  -these  coefficients  should,  of  course,  be  negative  since 
low  score  on  the  Personal  Inventory  is  a  good  one#  None  of  these  coef¬ 
ficients  attains  statistical  significance. 


Further,  the  reliability  of  the  criterion**  must  be  considered  in 
evaluating  all  of  the  validity  statistics  presented  in  this  Sec  .ion.  Un¬ 
fortunately,  the  ratings  used  as  criteria  were  made  by  only  one  officer, 
and  as  far  as  our  data  are  concerned,  only  once.  This,  of  course,  makes 
it  impossible  for  one  to  arrive  at  a  direct  statistical  measure  of  the 
consistency  of  these  evaluations.  Evidence  of  an  indirect  sort,  however, 
si^gests  that  these  ratings  are  reliable  enough  to  permit  significant  cor¬ 
relations  with  other  measures;  achievement  in  some  of  the  advanced  schools, 
as  measured  by  grades,  did  correlate  with  the  rating  data.  This  fact  per¬ 
mits  one  to  say  that  the  low  correlation  between  PI  scores  and  ratings  of 
performance  can  not  entirely  be  explained  by  the  unreliability  of  the 
criterion. 

Evaluations  reported  elsewhere  have  demonstrated  that  the  Personal 
Inventory  is  capable  of  making  reliable  discriminations  among  certain  type  : 
of  groups.  The  present  statistics,  however,  do  not  indicate  that  it  can 
predict  ratings  of  the  sort  used  in  this  study;  unfortunately,  with  the 
limited  amount  of  data  available,  it  is  not  possible  to  definitely  assign 
responsibility  for  the  low  correlations  to  the  PI,  the  criterion,  or  to 
some  factors  left  uncontrolled  in  the  investigation. 

B.  The  Validity  of  the  Other  Selection  Measures 
and  the  School  Grades 

The  other  measures,  —the  Two-Hand  Coordination  Test,  the  Otis  Self- 
Administering  Test,  the  Confidential  Questionnaire  and  the  NRC  Neurotic 
Inventory  —  perform  little  better  than  the  Personal  Inventory  (see  Tables 
6  and  7).  Of  the  32  validity  coefficients  reported  for  these  four  measures,? 
only  four  approach  statistical  significance  and  these  are  hardly  large 
enough  to  be  of  any  predictive  value.  These  results  again,  would  lead  one 
to  suspect  the  criterion  were  it  not  for  the  fact,  as  was ’pointed  out 
earlier,  that  oertain  of  the  school  grades  do  correlate  with  the  ratings# 

Grades  made  in  the  Submarine  Basic  training  courses  can,  to  a 
certain  extent,  be  used  in  predicting  submarine  officers'  ratings.  Likewise, 
certain  of  the  Advanced  School  grades,  in  particular,  those  received  after 
Diesel,  Battery  and  Gyro,  Ordnance,  Sound,  and  Radio  training,  offer 
possibilities  as  predictors  (see  Tables  6  and  7)*  In  general,  the  coef¬ 
ficients  secured  from  the  Pacific  fleet  data  tend  to  be  somewhat  higher 
(  and  more  of  them  attain  statistical  significance)  than  those  from  the 
Atlantic;  this  may  suggest  a  slight  superiority  of  the  ratings  received 
from  the  Pacific. 

In  comparing  the  validity  coefficients  which  were  obtained  for  the 
selection  measures  with  those  for  the  achievement  measures,  only  one  con¬ 
clusion  seems  to  be  possible:  if  one  accepts  the  criterion  as  valid,  the 
existing  achievement  measures  offer  more  possibilities  as  predictors  of 
submarine  performance.  More  extended  work  on  both  the  criterion  and  the 
school  grade  offers  more  by  way  of  possibilities  than  similar  work  writh 
the  existing  selection  batteryo 

li.  Statistics,  descriptive  of  the  reliability  of  the  Personal  Inventory, 
are  presented  in  the  reports  of  NDRC  Project  N-113  —see,  for  example, 
OSRD  Report  No.  3963. 


13 


-  \*1K.  '  \ 

.  ••  , 

RESTRICTED 


TABLE  6 

Validity  of  Section  end  Aohlerenent  Untune  in 
-  redioting  Submarine  Offio ere*  Ratings  of 
Enlisted  Men  -  ATLANTIC  FLEET 


Tojt  of  Achievement 
yen 3u  re 


Ape 


Da*io  Training,  Haw  Grade 
Pa  ale  Training,  Cla„  Standing 
Diefiol  Training,  Raw  Guide 
Diesel  Training,  Class  Standing 
3attt*ry  A  C>ro,  Raw  Grade 
Ba^tnry  4  Gyro,  Class  Standing 
Ordnance  Training,  Raw  Grade 
Ordnanoo  Training,  Class  Standing 
Sound  Training,  Raw  Grade  * 

Sound  Training,  Glass  Standing 
Radio  Training,  Raw  Grade 
Radio  Training,  Class  Standing 
Dunrtemn star,  Raw  Grade 
'\tnrrorHA3ter,  Class  Standing 
Lookout  Training,  Raw  Grade 
Lookout  Training,  Class  Standing  ’ 
Tank  Porformanoo 
Two-Hand  Coordination  Test 
Otis  Self-Admin  1  storing  Test 
New  London  Cofifidentlal  Question 
NRC  Neurotio  Inventory 


•  034 


(1) 

•  143* 

.104* 

-.187* 

•  322* 

-•198**  -,isa 

-•121 

e* 

-•041 

•  004 
-•092 

•  308** 

-.288 
-.063 

•  116 

-.120 

•  036 

i  p 

-.087 

•  136 
.127 
•036 
.092 

-•000 

-.001 


Rating  Soale  Trait 

(3)  (7)  Sum 

068  #  200*  , 207* 


.184*  .176*  3.20 


-.140*  -.238*  -.246*  60,08 

.199*.  .262*  .35!, 

-.249**  -.269*  62.38 


- 

20.01  3,04  360 


.095 

-.126 

-•104 

•  098 
•019 

•  030 
•021 

-.041 


-•126 

-.239  -.188 

.121 

.198  ,088 

-.027 

•037  ,049 

.017 

-.140  -.108 

•098 

.1*1  .031 

,028  .060 

.070 

.006 

NSfilBaw* 

•038  .097 

.009 

.003  -.017 

••074 

•023  -.020 

•  058  .riTfl 

-.022  -.142 

.178  .203 

-•14*  -.186 

♦  1W  .237 

-.182  -.208 

.000  .048 

-•144  .010 


104.80 


-9.88 


•  Cigmftoenfc  et  the  IX  level. 
•*  SlKnlfinmt  at  the  lev*!. 


4,1:4' 


V-  **  r  l(r. 
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TABLE  7 

Validity  of  Selection  and  Achievement  Measure  a  in 
Predicting  Submarine  Officers*  Ratings  of 
Enlisted  Men  —  PACIFIC  FLEET 


Test  of  Achievement  Rating  Scale  Trait 


Measure 

(1) 

(3) 

(7) 

Sum 

U 

<r 

V 

Age 

•  126* 

.117* 

.166* 

.174* 

20.30 

2.87 

761 

Basic  Training,  Raw  Grade 

•169* 

.169* 

.183* 

•  246* 

3.29 

•  22 

797 

Basic  Training,  Class  Standing 

-.166* 

-.082** 

-.183* 

•  •209* 

46.00 

28.77 

797 

Diesel  Training,  Raw  Grade 

.151 

.068 

.068 

.107 

3.05 

.26 

191 

Diesel  Training,  Class  Standing 

-.061 

.031 

-.070 

% -.070 

46.60 

29.16 

191 

Battery  A  Gyro,  Raw  Grade 

.299* 

.230* 

.328* 

.343* 

3.37 

•  28 

179 

Battery  A  Gyro,  Class  Standing 

-.299* 

-.214* 

-.268* 

-.289* 

46.02 

28.38 

179 

Ordnance  Training,  Raw  Grade 

.166** 

.287* 

.282* 

.361* 

3.33 

.16 

194 

Ordnance  Training,  Class  Standing 

-.097 

-.159** 

-.206* 

-.229* 

46.44 

28.93 

194 

Sound  Training,  Raw  Grade 

.160 

•  155 

.389* 

.317* 

3.28 

.19 

114 

Sound  Training,  Class  Standing 

-.146 

-.152 

-.327* 

-.281* 

49.70 

28.74 

114 

Radio  Training,  Raw  Grade 

•  242 

.203 

.588* 

.357* 

2.88 

•  56 

66 

Radio  Training,  Class  Standing 

-.292** 

-.226 

-.599* 

-.367* 

62.23 

28.14 

66 

Quartermaster,  Raw  Grade 

.176 

-.012 

-.203 

.067 

3.44 

•  22 

69 

Quartermaster,  Class  8tandi ng 

-.145 

-.0Q5 

-.145 

.073 

46.20 

27.47 

69 

Lookout  Training,  Raw  Grade 

•  191 

-.196 

•  139 

.093 

3.30 

•  25 

30 

Lookout  Training,  Class  Standing 

-.078 

•  219 

-.142 

-.055 

46.70 

25.69 

30 

Tank  Performance 

•122 

.121 

.099 

.067 

2.93 

.68 

806 

Two-Hand  Coordination  Test 

.084** 

.073 

.083** 

.110** 

94.27 

27.69 

746 

Otis- Self-Administering  Test 

•050 

-.012 

.056 

•048 

103.16 

10.06 

19  6 

Kerr  London  Confidential  Question 

-.061 

•020 

-.010 

-.013 

-8.66 

4.91 

632 

SRC  Neurotio  Inventory 

-.054 

-.041 

-.120** 

-.096 

12.24 

5.43 

443 

*  Significant  at  the  1%  level* 

**  Significant  at  the  VjL  level*. 


VI.  SUMMARY 


The  record  cards  of  the  Medical  Research  Laboratory  at  the  New  London 
Submarine  Base  supplied  the  test  scores  and  school  grades  of  some  1700  men 
who  were  later  evaluated  by  submarine  officers  on  a  seven-trait  graphic 
rating  scale.  The  present  report  attempts  to  evaluate  the  test  scores  and 
sohool  grades  as  predictors  of  later  submarine  performance. 

In  making  this  study,  the  following  procedures  were  carried  out: 

1.  The  submarine  ratings  were  subjected  to  extended  analysis  with 
the  end  in  view  of  selecting  (a)  those  vessels  whose  ratings 
offered  the  greatest  possibilities  as  criteria,  and  (b)  those 
traits  which  best  represented  the  abilities  and  performances 
measured  by  the  scale  as  a  ■whole.  Ratings  on  three  of  the 
scale  traits,  (Adaptability,  Courage  and  Adequacy  of  Overall 
Performance)  along  with  the  sum  of  the  ratings  for  all  seven 
traits,  from  29  submarines  of  the  Atlantic  Fleet  and  41  from 
the  Pacifio  were  finally  acoepted  for  use  as  criteria. 

2.  Test  scores  and  school  grades  were  correlated  with  the 
ratings  on  the  selected  traits.  Atlantic  and  Pacifio  fleets 
were  treated  independently. 

The  results  of  ihe  validity  analysis  may  be  summarized  as  follows: 

1.  Scores  cn  the  Personal  Inventory,  and  on  the  other  selection 
measures  employed  at  the  New  London  Submarine  Base,  were 
found  to  be  practioally  unrelated  to  performance  as  it  is 
described  by  officers'  ratings.  The  validity  coefficients 
are  probably  greatly  attenuated,  since  the  criteria  are  of 
questionable  reliability.  Further,  the  men  are  selected 
with  respect  to  the  abilities  investigated  here;  and  the 
criteria  may  be  considered  as  inappropriate,  in  the  sense 
that  the  items  which  comprise  the  selection  measures  were 
chosen  for  their  ability  to  make  discriminations  of  the  type 
which  may  not  at  all  be  related  to  those  called  for  by  the 
present  criteria. 

2.  An  analysis  of  psychiatrists'  evaluations  of  men  judged  to 
be  "emotionally  uns table"  suggests  that,  for  this  sample  at 
least,  these  evaluations  are  no  more  valid  than  those  made 
by  the  paper  and  pencil  tests  of  the  selection  battery. 

3.  On  the  other  hand,  grades  made  in  the  submarine  basic  train¬ 
ing  courses  can,  to  a  certain  extent,  be  used  in  predicting 
ratings  made  by  submarine  officers.  In  a  similar  manner, 
certain  of  the  advanced  school  grades  —  those  received  after 
Diesel,  Battery  and  Gyro,  Ordnance,  Sound,  and  Radio  training 
--  also  offer  possibilities  as  predictors. 


4»  School  grades  seem  to  offer  the  most  by  way  of  possibilities 
for  improved  prediction.  For  this  reason,  it  would  seem 
most  reasonable  to  direct  further  work  toward  the  improve¬ 
ment  of  both  the  school  grade  and  the  criterion. 


